Comparative analysis of multiple genome-scale data sets.
نویسندگان
چکیده
The ongoing analyses of published genome-scale data sets is evidence that different approaches are required to completely mine this data. We report the use of novel tools for both visualization and data set comparison to analyze yeast gene-expression (cell cycle and exit from stationary phase/G(0)) and protein-interaction studies. This analysis led to new insights about each data set. For example, G(1)-regulated genes are not co-regulated during exit from stationary phase, indicating that the cells are not synchronized. The tight clustering of other genes during exit from stationary-phase data set further indicates the physiological responses during G(0) exit are separable from cell-cycle events. Comparison of the two data sets showed that ribosomal-protein genes cluster tightly during exit from stationary phase, but are found in three significantly different clusters in the cell-cycle data set. Two protein-interaction data sets were also compared with the gene-expression data. Visual analysis of the complete data sets showed no clear correlation between co-expression of genes and protein interactions, in contrast to published reports examining subsets of the protein-interaction data. Neither two-hybrid study identified a large number of interactions between ribosomal proteins, consistent with recent structural data, indicating that for both data sets, the identification of false-positive interactions may be lower than previously thought.
منابع مشابه
Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms.
We describe a comparative mathematical framework for two genome-scale expression data sets. This framework formulates expression as superposition of the effects of regulatory programs, biological processes, and experimental artifacts common to both data sets, as well as those that are exclusive to one data set or the other, by using generalized singular value decomposition. This framework enabl...
متن کاملmulti‐dice: r package for comparative population genomic inference under hierarchical co‐demographic models of independent single‐population size changes
Population genetic data from multiple taxa can address comparative phylogeographic questions about community-scale response to environmental shifts, and a useful strategy to this end is to employ hierarchical co-demographic models that directly test multi-taxa hypotheses within a single, unified analysis. This approach has been applied to classical phylogeographic data sets such as mitochondria...
متن کاملIntegrating ‘omics’ data sets and biological knowledge: Multiple Factor Analysis as a powerful strategy
The huge amount of data provided by genome-scale technologies makes discernible biological meanings difficult to access. Here, we report a powerful integrative method to combine genome-wide scale data sets and biological knowledge. Multiple Factor Analysis (MFA) is used to investigate jointly large observation data sets from different ’omic’ areas enriched with biological annotations. This mult...
متن کاملA Data Envelopment Analysis Model with Triangular Intuitionistic Fuzzy Numbers
DEA (Data Envelopment Analysis) is a technique for evaluating the relative effectiveness of decision-making units (DMU) with multiple inputs and outputs data based on non-parametric modeling using mathematical programming (including linear programming, multi-parameter programming, stochastic programming, etc.). The classical DEA methods are developed to handle the information in the form of cri...
متن کاملGenoList: an integrated environment for comparative analysis of microbial genomes
The multitude of bacterial genome sequences being determined has generated new requirements regarding the development of databases and graphical interfaces: these are needed to organize and retrieve biological information from the comparison of large sets of genomes. GenoList (http://genolist.pasteur.fr/GenoList) is an integrated environment dedicated to querying and analyzing genome data from ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genome research
دوره 12 10 شماره
صفحات -
تاریخ انتشار 2002